Fundamental Frequency Estimation for Noisy Speech Using Entropy-Weighted Periodic and Harmonic Features

نویسندگان

  • Yuichi Ishimoto
  • Kentaro Ishizuka
  • Kiyoaki Aikawa
  • Masato Akagi
چکیده

SUMMARY This paper proposes a robust method for estimating the fundamental frequency (F0) in real environments. It is assumed that the spectral structure of real environmental noise varies momentarily and its energy does not distribute evenly in the time-frequency domain. Therefore, segmenting a spec-trogram of speech mixed with environmental noise into narrow time-frequency regions will produce low-noise regions in which the signal-to-noise ratio is high. The proposed method estimates F0 from the periodic and harmonic features that are clearly observed in the low-noise regions. It first uses two kinds of spec-trogram, one with high frequency resolution and another with high temporal resolution, to represent the periodic and harmonic features corresponding to F0. Next, the method segments these two kinds of feature plane into narrow time-frequency regions, and calculates the probability function of F0 for each region. It then utilizes the entropy of the probability function as weight to emphasize the probability function in the low-noise region and to enhance noise robustness. Finally, the probability functions are grouped in each time, and F0 is obtained as the frequency with the highest probability of the function. The experimental results showed that, in comparison with other approaches such as the cepstrum method and the autocorrelation method, the developed method can more robustly estimate F0s from speech in the presence of band-limited noise and car noise.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Bulletin of the Polish Academy of Sciences

A stable and accurate estimation of the fundamental frequency (pitch, F0) is an important requirement in speech and music signal analysis, in tasks like automatic speech recognition and extraction of target signal in noisy environment. In this paper, we propose a pitch-related spectrogram normalization scheme to improve the speaker – independency of standard speech features. A very accurate est...

متن کامل

New Time-frequency Domain Pitch Estimation Methods for Speech Signals under Low Levels of Snr

New Time-Frequency Domain Pitch Estimation Methods for Speech Signals under Low Levels of SNR Celia Shahnaz, Ph.D. Concordia University, 2009 Pitch estimation of speech signals is the key to understanding most acoustical phenomena as well as accurately designing many practical systems in speech communication. It is to determine the fundamental frequency or period of a vocal cord vibration causi...

متن کامل

Adaptive comb filtering for harmonic signal enhancement

A new algorithm is presented for adaptive comb filtering and parametric spectral estimation of harmonic signals with additive white noise. The algorithm is composed of two cascaded parts. The first estimates the fundamental frequency and enhances the harmonic component in the input, and the second estimates the harmonic amplitudes and phases. Performance analysis provides new results for the as...

متن کامل

Fundamental frequency estimation based on the joint time-frequency analysis of harmonic spectral structure

In this paper, we propose a new scheme to analyze the spectral structure of speech signals for fundamental frequency estimation. First, we propose a pitch measure to detect the harmonic characteristics of voiced sounds on the spectrum of a speech signal. This measure utilizes the properties that there are distinct impulses located at the positions of fundamental frequency and its harmonics, and...

متن کامل

Single complex sinusoid and ARHE model based pitch extractors

In this paper we propose two techniques for the estimation of the fundamental frequency of speech signals. The rst technique is based on the Autoregressive Harmonic Excitation (ARHE) speech model. ARHE model consists of an autoregressive process driven simultaneously by white noise and a periodic excitation. The second technique is based on the estimation of a complex sinusoid in white Gaussian...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • IEICE Transactions

دوره 87-D  شماره 

صفحات  -

تاریخ انتشار 2004